APPLICATIONS OF THE CLASSICAL UMBRAL CALCULUS 
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Dedicated to the memory of Gian- Carlo Rota 

Abstract. We describe applications of the classical umbral calculus to bilinear generat- 
ing functions for polynomial sequences, identities for Bernoulli and related numbers, and 
Kummer congruences. 



1. Introduction 

In the nineteenth century, Bhssard developed a notation for manipulating sums involv- 
ing binomial coefficients by expanding polynomials and then replacing exponents with 
subscripts. For example, the expression (a + 1)" would represent the sum Y17=o (?)^«- 
Blissard's notation has been known variously as Lucas's method, the symbolic method (or 
symbolic notation), and the umbral calculus. We shall use Rota and Taylor's term "clas- 
sical umbral calculus" [37] to distinguish it from the more elaborate mathematical edifice 
that the term "umbral calculus" has come to encompass [32, 33, 35]. 

The goal of this article is to show, by numerous examples, how the classical umbral 
calculus can be used to prove interesting formulas not as easily proved by other methods. 
Our applications are in three general areas: bilinear generating functions, identities for 
Bernoulli numbers and their relatives, and congruences for sequences such as Euler and 
Bell numbers. 

The classical umbral calculus is intimately connected with exponential generating func- 
tions; thus a" = ttn is equivalent to 

n=0 

and multiplication of exponential generating functions may be expressed compactly in 
umbral notation: 



n\ ) \'- — ' n\ I — ' n\ 

n=0 ^n=0 n=0 



is equivalent to (a + 6)" = c" 
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When I first encountered umbral notation it seemed to me that this was all there was 
to it; it was simply a notation for dealing with exponential generating functions, or to 
put it bluntly, it was a method for avoiding the use of exponential generating functions 
when they really ought to be used. The point of this paper is that my first impression was 
wrong: none of the results proved here (with the exception of Theorem 7.1, and perhaps 
a few other results in section 7) can be easily proved by straightforward manipulation 
of exponential generating functions. The sequences that we consider here are defined by 
exponential generating functions, and their most fundamental properties can be proved in 
a straightforward way using these exponential generating functions. What is surprising 
is that these sequences satisfy additional relations whose proofs require other methods. 
The classical umbral calculus is a powerful but specialized tool that can be used to prove 
these more esoteric formulas. The derangement numbers, for example, have the well- 
known exponential generating function X^^g DnX^ /n\ = /{1 — x) from which their basic 
properties can be derived; umbral calculus gives us the more interesting but considerably 
more recondite formula Y.n=o Dlx'^/nl = Y.t=o ^- ^V(l + x)^>'+^. 

Wc begin in the next section with a description of the classical umbral calculus, following 
Rota [34] and Rota and Taylor [36, 37], and point out some of the minor ways in which 
we differ from their approach. Next, in sections 3 through 5, we consider bilinear and 
related generating functions for Charlier and Hermite polynomials, and some variations. 
In section 6 we derive a bilinear generating function for the Rogers-Szego polynomials, 
which are related to g-Hermite polynomials. In section 7 we apply the umbral calculus 
to identities for Bernoulli and related numbers. Sections 8 through 10 deal with Kummer 
congruences and with analogous congrTicnccs for Bell numbers. 

The next section contains a formal description of the classical umbral calculus as used in 
this paper. The reader who is not interested in these technicalities may wish to go directly 
to section 3. 

2. The classical umbral calculus 

Most users of Blissard's symbolic notation have viewed it as simply a notational conve- 
nience, requiring no formal justification. Thus Guinand [23], in explaining the interpreta- 
tion of umbral symbols, writes: "In general, any step in manipulation is valid if and only 
if it remains valid when interpreted in non-umbral form." However, in 1940 E. T. Bell [5] 
attempted to give an axiomatic foundation to the umbral calculus. To the modern reader. 
Bell's approach seems ill-conceived, if not completely incomprehensible. A much more suc- 
cessful explanation was given by G.-C. Rota in 1964 [34]: When we interpret (a -|- 1)" as 
Y17=o iD'^i^ applying the linear functional on the algebra of polynomials in a that 

takes a' to a^. In retrospect. Rota's idea seems almost obvious, but we must remember 
that in Bell's day the concept of a linear functional was not the familiar notion that it 
is in ours. The seemingly mysterious "umbral variable" a is just an ordinary variable; it 
is in the invisible, but otherwise unremarkable, linear functional that the meaning of the 
umbral calculus resides. The "feeling of witchcraft" that Rota and Taylor [37] observe 
hovering about the umbral calculus comes from the attribution to umbrae of properties 
that really belong to these linear functionals. As in stage illusion, misdirection is essential 
to the magic. 
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Rota and Taylor's recent works [36, 37] expanded on Rota's original insight and in- 
troduced new concepts that help to resolve some of the ambiguities that may arise in 
applications of the traditional notation. However, I shall use the traditional notation in 
this paper. What follows is a short formal description of the classical umbral calculus as 
used here, based on Rota and Taylor's formulation, but with some modifications. 

In the simplest applications of the classical umbral calculus, we work in the ring of 
polynomials in one variable, e.g., R[a], where R is a ring of "scalars" {R is often a ring of 
polynomials or formal power series containing the rationals) , and we have a linear functional 
eval : R[a] R. (This notation was introduced by Rota and Taylor [36].) The variable 
a is called an umbral variable or umbra. There is nothing special about it other than the 
fact that the linear functional eval is defined on R[a]. We will often use the same letter 
for the umbra and the sequence; thus we would write for eval (a"). It is traditional, and 
convenient, to omit eval and to write a" = a„ instead of eval (a") = a„. However when 
following this convention, we must make clear where eval is to be applied. The rule that 
we shall follow in this paper is that eval should be applied to any term in an equation that 
contains a variable that has been declared to be umbral. It should be emphasized that 
this is a syntactic, not mathematical rule, so the formula a"^ = n is to be interpreted as 
eval(a") = n for all n, even though for n = 0, a does not "appear" on the left side. One 
important difference between our approach and that of Rota and Taylor [36, 37] is that 
they require that eval(l) = 1, but we do not, and in sections 7 and 9 we shall see several 
examples where eval(l) = 0. This involves some notational subtleties discussed below; 
nevertheless, there is no reason why a linear functional on polynomials cannot take 1 to 0, 
and there are are interesting applications where this happens. 

We shall often have occasion to deal with several umbrae together. It should be pointed 
out that although we use the symbol eval for whatever linear functional is under discussion, 
there are really many different such functionals. When we write a" = and 6" = 6„ we are 
really talking about two different linear functionals, evali : R[a] R and eval2 : R[b] R, 
where evali(a") = a„ and eval2(6'*) = The meaning of eval(a"*6") might be determined 
by a completely different linear functional on R[a, b], but traditionally one takes the linear 
functional evals defined by eval3(a™f)") = evali(a'") eval2(6"). In this case, we say that the 
umbrae a and b are independent (even though we are really dealing with a property of the 
linear functional evals rather than a property of the variables a and 5). In fact, applications 
of umbrae that are not independent in this sense are uncommon and do not seem to be 
have been considered before, and we shall assume that our umbrae are independent except 
where wc explicitly state otherwise. Nevertheless we give an example in section 5 of an 
application of umbrae that are not independent. 

Eschewing the requirement that eval(l) = 1 entails an additional interpretative issue that 
must be mentioned. We cannot assume that there is a "universal" evaluation functional 
that applies to every term in a formula; instead we may need a different functional for each 
term, corresponding to the variables that appear in that term. In section 9, for example, 
we have the formula 

pn ^ - {AB + C)", 
involving the umbrae F, A, B, and C, which must be interpreted as 

evali(F") = eval2(2y4") - eval3((45 + C)"), 
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where evali is defined on Q[F], evah is defined on Q[^], and evala is defined on Q[S,C]. 
Although the rule may seem unnatural when stated this way, in practice the interpretation 
is exactly what one would expect. 

We will often find it useful to work with power series, rather than polynomials, in our 
umbrae. However, if f{u) is an arbitrary formal power series in u and a is an umbra then 
eval(/(a)) does not make sense. Let us suppose that i? is a ring of formal power scries in 
variables x,y, z, . . . . Then we call a formal power series /(n) G ^[M] admissible if for every 
monomial x^y^ • • • in i?, the coefficient of x^y^z^ • • • in f{u) is a polynomial in u. Then 
if f{u) = J2ifi^^ is admissible, we define eval(/(a)) to be /j eval(a*); admissibility of 
/ ensures that this sum is well defined as an element of R. More generally, we may define 
admissibility similarly for a formal power series in any finite set of variables with coefficients 
involving other variables. 



In the next three sections we apply the classical umbral calculus to find bilinear gener- 
ating functions. More specifically, we find explicit expressions for generating functions of 
the form ^^anbnx'^''/nl, where there are simple expressions for the generating functions 
^^ttnx'^/nl and ^^6nx"/n!. Although it is not obvious a priori that such explicit ex- 
pressions exist, they do, and they have important applications in the theory of orthogonal 
polynomials (see, e.g., Askey [3]). The method that we use can be translated into a tradi- 
tional analytic computation, since in all cases that we consider in these three sections, eval 
can be represented by a definite integral (though in some cases the radius of convergence 
of the series is 0). For example, in this section we consider the umbra A evaluated by 
eval(^") = a{a + 1) • • • (a + n — 1). We could define eval analytically by 



and do all our calculations with integrals. In fact this idea has been used, in a significantly 
more sophisticated setting, by Ismail and Stanton [25, 26, 27] to obtain bilinear generating 
functions much more complicated than those we deal with here. 

The rising factorial («)„ is defined to be a{a+l) ■ ■ ■ (a+n — l). The Charlier polynomials 
Cn{x;a) are defined by 



A:=0 ^ ^ 

(see, for example, Askey [3, p. 14]), but it is more convenient to work with differently 
normalized versions of these polynomials, which we define as 



3. Charlier polynomials 



1 r°° 

evaI(/(A)) = /(X) 



•X' 



dx 





i=0 



Let us define the umbra Ahy A"^ = (a)„. Then 

Cn{u,a) = {A + ur. 



(3.1) 
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OO y 



^A. = Y,A-±^ = = (1 - ^)~"' (3-2) 

n=0 ■ n=0 

by the binomial theorem. So 

Y: Cniu, a)^= e(^+«)^ = e-e^^ = (3.3) 

n=0 ^ ' 

Our goal in this section is to prove the bilinear generating function for the Charlier 
polynomials, 

n=0 fe=0 ^ ' ^ ' 

To do this we first prove some properties of the umbra A. 
Lemma 3.1. For any admissible formal power series f, 



Proof. First we prove the lemma for the case f{z) = e^"'. We have 



(1 — y — w)°' 
1 1 

W 



exp ( -^w ] . (3.4) 



(1 - y)" 

by (3.2). Equating coefficients of w'^/k\ shows that the lemma is true for f{z) = . The 
general case then follows by linearity. 

Alternatively, we could have introduced an umbra F with e^^ = f{z) and replaced w 
with F in (3.4). □ 

As a first application of Lemma 3.1, we prove the following little-known result. 
Theorem 3.2. 

^2m{u, a) -e 2^ iuxfk^<^ k\ ■ 

m=0 fe=0 ^ ' 
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Proof. We have 

' ^ ml ' ^ ml 



m=0 



m=0 



(1 - 2mx)" 



exp 



1 — 2ux 

k 



by Lemma 3.1 



X 



k=0 



(1 - 2Mx)2fe+« /c! 



□ 



By a similar computation we can prove a generalization given by the next theorem. We 
leave the details to the reader. 



Theorem 3.3. 



rf.ni „.n 

^-^ ^ ml nl ^-^ (1 

m,n=0 



(a) 



2k 



k=0 



(1 - 2ux - yfk+a 



□ 



Next we prove the bilinear generating function for Charlier polynomials. An equivalent 
formula can be found in Askey [3, p. 16, equation (2.47)] with a minor error; a and b must 
be switched on one side of the formula as given there for it to be correct. A combinatorial 
proof of our Theorem 3.4 has been given by Jayawant [28], who also proved a multilinear 
generalization. 



Theorem 3.4. 



n=0 



k=0 



Proof. Let A and B be independent umbrae with A" = (a)„ and = {f3)n- Then there 
is an analogue of Lemma 3.1 with B replacing A and (3 replacing a. 
We have 



n=0 



^vx ^Avx ^{Bu+AB)x 



= e 



(1 — vxY 



exp ( Bux + 



A 



1 — vx 



Bx I by Lemma 3.1 



„uvx 

^ Rut 

-e exp 



(1 - ra)" 



A 



1 — vx 



-Bx 



(1 — vx)°' (1 — ux)^ \1 — vx 1 — ux 



A 



B 



X J by Lemma 3.1 
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^ uvx W)k 

^ (1 - vxY+'^ (1 - uxY+P k\ ■ 

□ 

The polynomials Cn{u, a) have a simple interpretation in terms of permutation enumer- 
ation: the coefficient of a^u^ in Cn{u — a, a) is the number of permutations of {1,2,... , n} 
with j fixed points and i cycles of length at least 2. This follows easily from the exponential 
generating function 



^ — X \OlOO 

) =y^ Cniu- a,ay- 
l — X / ^-^ 



n=0 

(See, for example, Stanley [39, chapter 5].) In particular, C„(— 1,1) is the derangement 
number the number of permutations of {1,2,... , n} with no fixed points, and Theo- 
rems 3.2 and 3.4 give the formulas 

v^n m x^ 

m=0 fe=0 ^ ' 

and 



x". 



Vl>2— = e^V- 
n=0 k=0 ^ ' 

Theorem 3.4 can be generalized to a formula involving 3-line Latin rectangles. See [21] 
for a combinatorial proof that also uses umbral methods. A more general result was given 
using the same technique by Zeng [45], and using very different techniques by Andrews, 
Goulden, and Jackson [2]. 

4. HERMITE POLYNOMIALS 

We now prove some similar formulas for Hermite polynomials. Perhaps surprisingly, the 
proofs are a little harder than those for Charlier polynomials. We first define the umbra 
M by 

e^^ = e-^', (4.1) 

so that 

I 0, if n is odd. 

(The reason for the minus sign in this definition is so that we can obtain formulas for 
the Hermite polynomials in their usual normalization.) There are two basic simplification 
formulas for M: 



Lemma 4.1. 

(i) e^'" = 
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(ii) For any admissible formal power series f, we have 

e^yf{M)=e-y'f{M-2y). 



Proof. For (i), we have 

= E(-i 



A;! k\ Vl + 4x ■ 



e 

fe=o 



For (ii), as in the proof of Lemma 3.1, it is sufficient to prove that the formula holds for 
f{z) = e^'^. In this case we have 

□ 

Lemma 4.2. 

^Mx+M-'y _ ^ 

Although Lemma 4.2 can be proved directly by showing that both sides are equal to 

^ ■+fc (2j + 2fc)! x^^ 
2-^ ^ (j + fc)! (2j)!A;!' 

we give instead two proofs that use Lemma 4.1 

First proof. If we try to apply Lemma 4.1 directly, we find that the linear term in M does 
not disappear, so we need to use a slightly less direct approach. We write eJ^^^^ v as 
^M{x-}-z)^M'^y-~Mz ^ where z will be chosen later. Now applying Lemma 4.1 gives 

^M(x+z) ^M'^y-Mz _ ^-{x+zf ^{M-2x-2zfy-{M-2x-2z)z 

_ ^-(x+z)'^ ^M'^y-4M{x+z)y+A{x+z)'^y-Mz+(2x+2z)z 

We now choose z so as to eliminate the linear term in M on the right; i.e., we want 
— 4(x + z)y — z = 0. So we take z = — 4a;y/(l + Ay), and on simplifying we obtain 
^Mx+M^y ^ g-xV(i+42/)+M2y rj.^^^ applying Lemma 4.1 (i) gives the desired result. □ 

Second proof. Let us fix y and set g{x) = q^^+^'^v^ Applying Lemma 4.1 directly gives 

^(•3,^ ^ ^Mx+M^y ^ ^-x'^ ^{M-2xfy 

Iterating and taking a limit yields 

^(3.) ^ g-a;2(l-42/)-42x2j/2(l-42/)— • ^ ^-x^l-4y+4:^y^-4:^y^+-) ^^q-^ 

by Lemma 4.1 (i). □ 
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Now we define the Hermite polynomials Hn{u) by the generating function 

y HJu)^ = e^"^-^ = e(2«+^)^ (4.2) 



n=0 



so that Hn{u) = {2u + My. 

First we prove a well-known analogue of Theorem 3.2, a special case of a result of Doetsch 
[10, equation (10)]. 



Theorem 4.3. 



n=0 

Proof. We have 



^ " n! 



n=0 



, ''^ e^"^^ exp { ^ \ by Lemma 4.2 

1 / ^u^x 



□ 



By the same reasoning we can prove the following generalization of Theorem 4.3. 
Theorem 4.4. 

rr . 1 f 4u'^x + 2uy-y^\ 
V H2m+n{u)—r—r = , , exp — — . □ 

Equating coefficients of y"/n! in both sides of Theorem 4.4, and using (4.2) yields 

?TI=0 ^ * ^ ^ ^ 

which is the general form of Doetsch's result [10]. 

We state without proof a "triple" version of Theorem 4.3 that can be proved by the 
same technique. See Jayawant [28], where umbral and combinatorial proofs are given. 

Theorem 4.5. 

~ X" _ e^^'^+^^^^'"' ^ (-l)"(6n)! x^^ 

3n(«) - ^1 ^ 48tix)V4 (3n)! (1 + 48ua;)3"/2 (2n)! ' 

w/tere = + A8ux - l) /(24x). □ 



Next we prove Mehler's formula, which gives a bilinear generating function for the Her- 
mite polynomials. An elegant combinatorial proof of this formula has been given by Foata 
[13], and generalized to the multilinear case by Foata and Garsia [14, 15]. 
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Theorem 4.6. 

n=0 V \ / 

Proof. We use two independent umbrae, M and N, with M as before and AT" = M" for 
all n. (In the terminology of Rota and Taylor [37, 36], M and N are "exchangeable" 
umbrae.) Then 

oo „ 



n=0 



g2u(2j;+Ar) x gM(2j;+A'')a; 



^ g2«(2j;+JV)Xg-(2«+JV)2a;2 ^^ -^^ 

^4:vx{u—vx) ^2Nx{u—2vx)—N'^x'^ 

^4vx{u-vx) / 4^x'^(u — 2vx)'^^ 

— ■ exp I ^ 1 by Lemma 4.2 



Vl^^ ^\ 1-4x2 

1 — (u^ + f2)x2 

: exp 4 



VI - 4x2 V 1-4x2 

□ 

5. CaRLITZ and ZEILBERGER'S HERMITE POLYNOMIALS 

Next we consider analogues of the Hermite polynomials studied by Carlitz [8] and Zeil- 
berger [44]. Carlitz considered the "Hermite polynomials of two variables" 

miii(m,n) 

,m—k„,n—k 



k=0 

with generating function 



Ex™ 
777.1 r).\ 

m,n 

and proved the bilinear generating function 

EHm,niui,Vi)Hm,niu2,V2)—r—r 
m 77, 



,ux+vy+xy 

m\ n! 



m\ ni 

m,n=0 



x_l f UlU2X + VlV2y+{uiVl+U2V2)xy\ 

= (l-xy) exp(^ j. (5.1) 

Independently, Zeilberger considered the "straight Hermite polynomials" 

min(m,n) / \ / \ 
fe=0 V / V / 
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with generating function 



ml ni 



and gave a combinatorial proof, similar to Foata's proof of Mehler's formula [13], of the 
bilinear generating function 

\^ u ( \u I n x-i (x + y + {u + v)xy\ 

> Hm,n{u)Hm,n{v) — — = (1 - uvxy) exp . (5.2) 

■"^^ ml ni V 1 — uvxy J 

m,n=o ^ y / 

It is easy to see that Carlitz's and Zeilberger's polynomials are related by Hm,niu,v) = 
u"^v"'Hm,n{^/uv), and that (5.1) and (5.2) are equivalent. We shall prove (5.2), since it 
involves fewer variables. Our proof uses umbrae that are not independent. 
We define the umbrae A and B by 

A^B^ = dm,nml, 

where 5m,n is 1 if m = n and otherwise. Equivalently, A and B may be defined by 

^Ax+By ^ ^xy_ (5 3) 

Then Zeilberger's straight Hermite polynomials are given by Hm,n{u) = {l + A)'^[l + Bu)"'. 
Two of the basic properties of these umbrae are given in the following lemma. 

Lemma 5.1. 

(i) If f{x, y) is an admissible power series then e^^'^^^ f{A, B) = e^^f{A + r,B + s). 

(ii) e^^+By+ABz ^ _J_^xy/{i-z) _ 

Proof. As in the proof of Lemma 3.1, it is sufficient to prove (i) for the case f{x, y) = e^'^'^v", 
and for this case we have 

^Ar+Bs j^j^ ^-j _ ^Ar+Bs^Au+Bv _ ^A{r+u) ^B{s+v) _ ^{r+u){s+v) 

by (5.3), and 

grs _|_ ^ _|_ _ ^rs ^{A+s)u+{B+r)v _ ^rs+rv+su^Au+Bv 

We can prove (ii) by using (i) to reduce it to the case x = y = 0, but instead we give a 
direct proof. We have 

Ax+By+ABz _ ST^ Ai+k r>j+k^^ 

= V ^ (^^)' = 1 ,xyl(l-z) 

^ (1 - z)J+i j\ 1-z 

□ 
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Now we prove Zeilberger's bilinear generating function (5.2). We introduce two independent 
pairs of umbrae Ai,Bi and A2, B2 such that each pair behaves hke A, B; in other words, 

AlB{A^B^ = Sk,i5m,nk\ml 

Then 



m,n=Q 



X"" y" 



m „,n 



= y (1 + Aini + + A2r{i + ^2^;)"^^ 

ml n! 

m,n 

^ ^{1+Ai){l+A2)x+{1+Biu){l+B2v)y 

^ g(l+A2)a;+(l+B2i')2/gAi(l+A2)a;+Bi(l+B2u)wj/_ 



Applying (5.3) with Ai and Si for A and 5 yields 

^{l+A2)x+{l+B2v)y+{l+A2){l+B2v)uxy _ ^x+y+uxy ^A2x{l+uy)+B2vy{l+ux)+A2B2Uvxy 

Then applying Lemma 5.1 (ii) yields (5.2). 

By similar reasoning, we can prove a generating function identity equivalent to the 
Pfaff-Saalschiitz theorem for hyper geometric series [22]. In terms of Carlitz's Hermite 
polynomials of two variables, this is the evaluation of 



y^-f^m,n+j(0, l)Hm+i,niO, 1) - — 

ml ni 



Theorem 5.2. Let i and j be nonnegative integers. Then 

'm + i\(n + j\ ^„ {l + xy{l + yy 



„ , n J \ m 

m,n=U 

Proof. With Ai, Bi, A2, and B2 as before, we have 



(1 + Bi)^+^A1^{1 + B2r+' = ml n! ^ ^ , 
so the left side of (5.4) is equal to 

y AT{i + si)"+^A^(i + B2r+'^^ 

ml nl 

m,n 

^ gAi{l+B2)x+A2(l+Bi)y(;L ^ ^2)^(1 + Biy. (5.5) 

Multiplying the right side of (5.5) by u^v^ /iljl, and summing on i and j, we obtain 

^Ai{l+B2x)+A2{l+Bi)y+{l+B2)u+(l+Bi)v ^ ^u+v ^Ai(l+B2)x+Bi{v+A2y)+A2y+B2U 

Applying (5.3), with Ai and Bi for A and 5, gives 

^u+v ^{l+B2){v+A2y)x+A2y+B2U _ ^u+v+xv ^A2{l+x)y+B2{u+xv)+A2B2xy 
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Applying Lemma 5.1 (ii), we obtain 

^u+v+xv f (I + x){u + xv)y\ 1 fil + x)v + {l + y)u 

exp = exp 

1 — xy \ 1 — xy J 1 — xy \ 1 — xy 

and extracting the coefficient of u^v^ gives the desired result. □ 

We can also prove analogues of Doetsch's theorem (Theorem 4.3) for the straight Hermite 
polynomials. We need the following lemma, which enables us to evaluate the exponential 
of any quadratic polynomial in A and B. 



Lemma 5.3. 



^Av+Bw+A^x+ABy+B^z _ ^ cxp 



vw{l — y) + v^z + uP'x 



V'(l - y f - V (1 - y) - 

Proof. Since the proof is similar to earlier proofs, we omit some of the details. The case 
V = w = Q is easy to prove directly. For the general case, we write e.^'v+Bw+A x+ABy+B z 
as 6"^''+^* • Q-A'^-Bs+Av+Bw+A x+ABy+B z ^^^^ choose r and s so that when Lemma 5.1 (i) 
is applied, the linear terms in A and B vanish. We find that the right values for r and s 
are 

v{l — y) + 2wx ^ w{l — y) + 2vz 

(1 — y)2 — 4xz (1 — j/)^ — 4xz ' 

and the result of the substitution is 

' vw{l-y)+V^Z + Upx \ A^^+ABy+B^z 

(1 - y)2 -Axz J 

which may be evaluated by the case v = w = 0. □ 
Theorem 5.4. 



exp 



^ — ^ m 1 r) 1 



ml ni 

m,n=0 



oo 



E^m yU 2 / X + y + Auxy \ 



> Hm,m{u)—r = exp 

^-^ ml 1 — ux \1 — ux 

m=0 

Proof. For the first formula, we have 

'^^-^ tYl ft OO 'Y)1 111 

ml nl ^-^ ml n\ 

m,n=0 m,n=0 

We simplify this with Lemma 5.3. The proofs of the other two formulas are similar. (The 
third formula is equivalent to a well-known generating function for Laguerre polynomials.) 

□ 

By the same reasoning, we can prove a more general formula that includes all three 
formulas of Theorem 5.4 as special cases. 
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Theorem 5.5. 

/ (1 + + (1 + uf + 4ttxy + (1 — ■uz)(t; + i/; + z + \ 

6. ROGERS-SZEGO POLYNOMIALS 

Next we give a proof of a bilinear generating function for the Rogers-Szego polynomials, 
which are closely related to g-Hermite polynomials. Our proof differs from the other proofs 
in this paper in that it uses a linear functional on a noncommutativc polynomial algebra. 
A traditional proof of this result can be found in Andrews [1; p. 50, Example 9] which is 
also a good reference for basic facts about g-series. 

In this section we let {a)m denote the g-factorial 

{a)^={\-a){\-aq)---{\-aq^-^), 
with (a)oo = liinm->oo(oi)m as a power series in q. In particular, 

{^q)^ = {\-q){\-i)---{\-qn- 

The g-binomial coefficient [^] is defined to be {q)n/{Q)k{Q)n-k- The Rogers-Szego polyno- 
mials Rn{u) are defined by 



Rn{u) = ^ 
fe=0 '- 

We will use a g-analogue of the exponential function. 



oo ^ 
X^' 



n=0 



n=0 (^)- 



We will also need the g-binomial theorem 

oo 

E 

the special case a = gives 

(^)oo 

from which it follows that e{q^x) = {x)je{x). 

If A and B are noncommuting variables satisfying the commutation relation BA = qAB, 
then it is well known that 

n 



(6.1) 



{A+Br = Y^ 

k=0 

and it follows easily from (6.1) that e((A + B)x) = e{Ax)e{Bx), where x commutes with 
A and B. We shall also need the easily-proved fact that B^A^ = q^^A^BK 
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Now let A, B, C, and D be noncommuting variables such that BA = qAB, DC = qCD, 
and all other pairs of variables commute. We shall work in the ring of formal power series in 

A, B, C, D, with our ring of scalars (which commute with everything) containing variables 
u, V, X and q. We define our evaluation functional by eyal{A^ B^ C'' D'-) = n'w^. 

Since we need to do some of our computations in the ring of formal power series in A, 

B, C, and D, we write out the applications of eval explicitly in this proof. 

Theorem 6.1. 



oo 



^^Rn{u)Rn{v) ^^^^ (™x)oo(«a;)oo(ra)oo(a;)oo' 
Proof. By (6.1), 

(CO „ \ oo „ 

Y^iA + BnC + Dr-^]=Y,Rn{u)Rn{v)-^. (6.2) 
n=0 ^^^"/ n=0 ^^^n 

Also, we have 

oo 

^{A + Bf{C + = e{{A + B){C + D)x) 

n=o 

= e{A{C + D)x)e{B{C + D)x) 

= e{ACx)e{ADx)e{BCx)e{BDx). (6.3) 

The only variables "out of order" in this product are the D's and C's in e{ADx)e{BCx), 
so 

evai{e{ACx)e{ADx)e{BCx)e{BDx)) 

= eval(e(^Cx)) eval {e{ADx)e{BCx)) eval{e{BDx)) 
= e{uvx) Q\a\{e{ADx)e{BCx))e{x) 
eYa\{e{ADx)e{BCx)) 



We have 



so 



{x)oo{uVx)c 



eiADx)eiBCx) - V _ V "^'^'^'^'1''^'^' 

e[ADx)e[BCx) - ' ^ 



(6.4) 



^ _(^^^)^ ^ - (^ 

£^ (g)i(rag*) 
1 (ura^)c 



yoo 



- . (6.5) 

[VX)^ [uX)oo 

The theorem then follows from (6.2), (6.3), (6.4), and (6.5). □ 
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It is worth pointing out that although our proof uses noncommuting variables, it does 
not yield a noncommutative generalization of the result, since the last application of eval 
is necessary for the final simplification. 

7. Bernoulli numbers 
The Bernoulli numbers Bn are defined by the exponential generating function 

oo „ 

BW = ^fi„^ = — , (7.1) 

n=0 

Since (7.1) implies that e^B{x) = x + B{x), the Bernoulli umbra B defined by 5" = S„ 
satisfies 

{B + ir = B^ + 5n-i, (7.2) 

where 5m is 1 if m = and is otherwise. Prom (7.2) it follows by linearity that for any 
admissible formal power series /, 

f{B + l) = f{B) + f'{0). (7.3) 

Formula (7.3) may be iterated to yield 

f{B + k) = f{B) + /'(O) + /'(I) + • • • + f'ik - 1) (7.4) 

for any nonnegative integer k. 

There are three other important basic identities for the Bernoulli umbra. Although the 
most straightforward proofs use exponential generating functions, the umbral proofs are 
interesting and arc therefore included here. Very different umbral proofs of these identities 
have been given by Rota and Taylor [37, Theorem 4.2 and Proposition 8.3]. 

Theorem 7.1. 

(i) {B + 1)" = (-5)". 

(ii) (— -B)" = B"' for n ^ 1, with Bi = — ^. Thus B^ = when n is odd and greater 
than 1. 

(iii) For any positive integer k, 

kB"^ = {kBY + {kB + lY + --- + {kB + k- ly. 
Proof. We prove "linearized" versions of these formulas: for any polynomial /, we have 

f{B + l) = f{-B) (7.5) 
/(-i?) = /(S) + /'(0) (7.6) 
kf{B) = f{kB) + f{kB + 1) + . . . + f{kB + k-l) (7.7) 

Pirst note that (7.6) follows immediately from (7.5) and (7.3). We prove (7.5) and (7.7) by 
choosing polynomials /(x), one of each possible degree, for which the formula to be proved 
is an easy consequence of (7.3). 



APPLICATIONS OF THE CLASSICAL UMBRAL CALCULUS 17 

For (7.5), we take f{x) = — {x — 1)", where n > 1. Then 

f{B + l) = {B + ir-B^ = 5n-i by (7.2), 
and since f{—x) = {—l)"'~^f{x + 1), we have 

f{-B) = (-1)"-V(S + 1) = i-ir-'Sn-l = Sn-l = f{B + 1). 

For (7.7), we take f{x) = {x + 1)" - x", where n > 1. Then f{B) = and 
fc-1 fc-1 fc-1 

f{kB + i) = Y,{kB + i + lT- Y,{kB + 1)" 

i=0 i=0 i=0 

= {kB + kT - {kB)-^ = k''{{B + 1)" - 5") 

= k^6n-l = kSn-1 = kf{B). 

□ 

For later use, we note two consequences of Theorem 7.1. First, combining (7.4) and 
(7.6) gives 

fc— 1 

f{B + k)-f{-B) = Y,f'ii)- (7.8) 

i=l 

Second, suppose that f{u) is a polynomial satisfying f{u + 1) = /(— n). Then we have 

/(5) = i(/(2B) + /(2S + l)) by (7.7) 
= i(/(2B) + /(-2S)) 

= /(2S) + /'(0) by (7.6). (7.9) 

Next, we discuss an identity of Kaneko [29], who set Bn = {n + l)Bn and gave the 
identity 

n+l , 1 1 \ 

B„+, = o, (7.10) 

noting that it (together with the fact that B2j+i = for j > 0) allows the computation 
of B2n from only half of the preceding Bernoulli numbers. Kancko's proof is complicated, 
though his paper also contains a short proof by D. Zagier. We shall show that Kaneko's 
identity is a consequence of the following nearly trivial result. 

Lemma 7.2. For any nonnegative integers m and n, 

g(7)«...(-.r-E(;0- 

Proof. Take f{x) = x'"(x - 1)" in (7.5). □ 
The key to Kaneko's identity is the observation that 



m+j- 



n + 1\ 

^ _ Bn+i = [n + l) 



n + l\ ( n 
+ 



,i - 1 

which reveals that (7.10) is simply the case m = n + 1 of Lemma 7.2. 



Bn+i, (7.11) 
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We can generalize Kaneko's identity in the following way: 
Theorem 7.3. 



1 g 3„+i-. fn + 1\ ^^^^ ^ _ 
^ J2 4"+i- + ^) = (-1)"(4" + (2 - |n)3") , 



=0 

and in general, 

^ p + j ^ J2 ((2n + l)z - (n + l)k)e{i - kT'K (7.12) 

Proof. Using (7.11), we see that the left side of (7.12) is 

{B + kT+^B'^ + B"+i(B + kf. 
Setting f{x) = x'^{x — A;)" in (7.8), we have 

k-l 

{B + fe)™5" - (-l)"^+'^5™(5 + fc)" = ^((m + n)i - km)i"'-\i - fc)"" 



\iu -r 'tjt — Klltj l yi — Kj 

Setting m = n + 1 gives (7.12). 



□ 



There are several interesting identities for Bernoulli numbers that actually hold for any 
two sequences (cn) and {dn) related umbrally by = (c+ I)'*; i.e., 



n 

' n 



i=0 ^^-^ 

We note that (7.13) may inverted to give = (d — 1)"; i.e.. 



i=0 

By Theorem 7.1 (i), (7.13) holds with = S„, dn = (-1)"'S„. We shall next describe 
several pairs of sequences satisfying (7.13), and then give some identities for such sequences, 
which seem to be new. 

Since (7.13) is equivalent to 

/ , '^n r = 6 y Cji j- , 

n=0 n=0 

it is easy to find sequences satisfying (7.13) with simple exponential generating functions, 
though not all of our examples are of this form. 
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The derangement numbers D„ satisfy n! = Yll=o {T)^i C^-^^) holds with = Dn, 
dn = n\. 

For any fixed nonnegative integer m, the StirUng numbers of the second kind S{m,n) 
satisfy n'" = Y17=o (")^! S{m, i), so (7.13) holds with c„ = n! S{m, n), dn = n^. 

The Euler numbers are defined by EnX^/nl = sechx. Let us define the "signed 

tangent numbers" T„ by tanhx = -^"■^"/'^" Then since e^sechx = 1 + tanha;, we 

have that (7.13) holds with c„ = d„ = 5„ + T„. 

The Genocchi numbers Qn are defined by Yl^=o Qnx"^ / = 2x/{e^ + 1). Then 

2xe^ ^ 2x 
= 2x- 



+ 1 + 1 ' 

so (7.13) holds with Cn = gn, dn = 26n-i — gn (so that di = gi = 1). 
The Eulerian polynomials An{t) satisfy 



oo 



Mt)-T = 



n=0 

Then Aq = 1, and An{t) is divisible by i for n > 1. Let us set An,{t) = t~^An{t) for n > 0, 
with Ao{t) = 1. It is easy to check that 

oo ^ oo n 

n=Q ' n=0 

SO (7.13) holds with c„ = A(t)/(1 - t)", d„ = i„(t)/(l - t)". 

The Fibonacci numbers F„ are defined by Fq = 1, Fi = 1, and F„ = Fn-i + Fn-2 for all 
integers n. It is easily verified that for every fixed integer m, (7.13) holds with = F^^n, 

dn = Fin+2n; fUld also witll Cn = Fm-n, dn = F^+n- 

By the Chu-Vandermonde theorem, (7.13) holds with 



where («)„ = a(a + 1) • • • (a + n — 1). 

As Zagier observed [29], it is easy to characterize the pairs of sequences satisfying (7.13) 
with dn = (— l)"c„, which, as we shall see, give analogues of Kaneko's identity. The 
condition, with c{x) = e*^^ and d{x) = e*^^, is e^c(x) = c(— x), which is equivalent to 
e^/^c(x) = e^^/^c(— z); i.e., e^/^c(a;) is even. Thus it is easy to construct such sequences, 
but not many seem natural. In addition to the Bernoulli numbers, we have an example 
with the Genocchi numbers gn, 



2e-^/2 



gn+1 X' 



^ + 1 n + 1 n! ' 

ra=0 

and one with the Lucas numbers, c„ = (— 2)^"'(L„ + L2n), where L„ = -Fn+i + Fn-i, 

We now discuss the identities which are consequences of (7.13). Our first identity gen- 
eralizes Lemma 7.2. 
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Theorem 7.4. Suppose that the sequences Cn and dn satisfy (7.13). Then for all nonneg- 
ative integers m and n, 



Proof. Let c and d be umbrae with = and = d„. Then (7.13) imphes that 
(c+1)" = d"', so for any polynomial f{x), we have /(c+1) = f{d). Taking f{x) = x"*(x— 1)" 
yields the theorem. □ 

An application of Theorem 7.4 yields an interesting recurrence for Gcnocchi numbers. 
Let Cn be the Genocchi number gn, so that, as noted above, dn = 26n-i — gn- Then taking 
m = n in Theorem 7.4, we have for n > 1, 



E(:)— E(:)(-i)-w 



i=0 ^ ^ i=0 

so 



J2{l + {-lr-l(^fjgn+^ = 0. 



i=0 

The only nonzero terms in the sum are those with n — i even, so we may set 2j = n — i 
and divide by 2 to get the recurrence 

L«/2J . X 

E L?-^2n-2, =0, n>l. (7.15) 



i=o 



Equation (7.15) is known as Seidel's recurrence (see, e.g., Viennot [42]). It implies that 
g2n is an integer, which is not obvious from the generating function (it is easily shown that 
52i+i = for i > 0), and it can also be used to derive a combinatorial interpretation for the 
Genocchi numbers. The reader can check that the Genocchi analogue of Kaneko's identity 
alluded to before Theorem 7.4 is also Seidel's recurrence in the form Y17=o il) 9n+i = (in 
this form true for all n). 

We now derive some further identities for sequences satisfying (7.13), of which the first 
generalizes Theorem 7.4. 

Theorem 7.5. Suppose that the sequences (cn) and (dn) satisfy Y17=o {T)^^ ~ ^^^^ 
for all a and b, 

a\ f b \ f(i\fa + b — j 



t (:) (" : f ) - g („ - j) (" ; <-)-^'"-- ("^) 
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Proof. Let c be an umbra with c" = c„. Then (7.16)-(7.18) follow by substituting c for u 
in the following polynomial identities, where v = 1 + u: 



^. , .n — il ^ — ' \ 7 / \ n — 7 

|:(:)(r-r)<--''-g(„-.)(";>--'"-^'- 
g(:)(rr)(-)-(-')"gG)C::r)<-''- ^-^^ 

We prove (7.19) by extracting the coefficient of in (1 + xf'{l + vxY in two ways. It 
is clear that this coefficient is given by the left side of (7.19). But we also have 

(1 + x)^(l + vxf = (1 + x)\l + X + uxf = (1 + xf^'' ( 1 + , 

V l + x J 

in which the coefficient of is easily seen to be given by the right side of (7.19). 
For (7.20), we extract the coefficient of .x" in 

((1 + xf - 2xvy = ((1 + xf -2x- 2xuy = (1 + - 2xu)''. 

For the left side we have 



((l + x)^-2x.)« = (l + x)-(l-^^)" 

1' ^ 



ij (1 + x)2i 
^^x'{l + x)^''-^\-2vy 



and the coefficient of x"" is the left side of (7.20) . 
For the right side we have 



(1 + - 2xu)" = {^'^ - ^^^y 

i ^ ^ 

-E(:)C)^^'(-w- 

=e(1')G)^*"'(-^'""'- 



Setting i = n — j gives the right side of (7.20) as the coefficient of x". 
For (7.21) we start with the identity 

((1 + xf - AxvY = ((1 + xf -Ax- AxuY = ((1 - xf - Axuf . 
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The coefficient of may be extracted from both sides as on the left side of (7.20). □ 

We note that (7.19) is equivalent to a 2-^1 linear transformation and (7.20) to a 2-P1 
quadratic transformation. Equation (7.21) is actually a special case of (7.19); it can be 
obtained from (7.19) by replacing a with 2a — n and b with n — a — ^, and simplifying. 

The special case a = — 1 of (7.16) is worth noting. It may be written 



±( ' ){-!)% = {-ir±( 

^ \n — ij ^ V 



(7.22) 



If we replace n by m + n in (7.22) and then set 6 = n, it reduces to (7.14). 

Next we prove a remarkable identity of Zagier [43] for Bernoulli numbers. Our proof is 
essentially an umbral version of Zagier's. The reader may find it instructive to compare 
the two presentations. 



Theorem 7.6. Let 



n 

E 

r=0 



n + r\ Br 
2r J n + r 



for n > 0. Then the value of B* for n odd is periodic and is given by 



n (mod 12) 


1 


3 


5 


7 


9 


11 




3 
4 


1 

4 


1 
4 


1 
4 


1 

4 


3 
4 



Proof Since ("2^'') = C2r-i)^ for r > 0, we have 

00 00 r -. n 



n=l 



n=l 

= -log(l 



r=l 

\2 



n + r — 1 
2r- 1 
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X 



x) 



log 1 - S 



(i-xy 



log((l - xf - Bx) 



Now let g{u) = - log(l - ux + x^), so that 2 B*x'' = g{B + 2). Note that 

9'{u) = 



1 — ux + x"^ 

Taking k = 4 and f{u) = g{u — 2) in (7.8), we have 

g{B + 2) - g{-B - 2) = ^'(-1) + ^'(0) + ^'(1) 

X X 



+ 



+ 



1 + X + x"^ 1 + X^ 1 — X + x^ 



But g{-B - 2) 



_ 3x- x-^ - x^ + x^ + x^ -3x 

log ((1 + x f + Bx)=2 E~ 1 B*{-xr, so 
g{B + 2)-g{-B-2) = AY, K^", 



11 



n odd 



and the result follows. 



□ 
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8. KuMMER Congruences 

We say that a sequence (un) of integers satisfies Kummer's congruence for the prime p 
if for every integer n and every j > n, 



f2{-lr-'(%^ip-l)+,^0 (modp-). (8.1) 

i=0 



There are many variations and generaHzations of this congruence, and we refer the reader 
to [19], on which most of this section is based, for more information and further references. 
If we set j = n + k, then (8.1) may be written umbrally as 

(uP-n)V = (modp'*) (8.2) 

for all n,k >0, where u"^ = Um- 

The result that we prove here shows that if a sequence satisfies Kummer's congruence, 
then so does the coefficient sequence of the reciprocal of its exponential generating function. 
Similar results apply to products. 

Theorem 8.1. Let (un) and (vn) be sequences of integers satisfying 

\n=0 / \n=0 / 

Then if (u„) satisfies Kummer's congruence for the prime p, so does {vn)- 

Proof. The relation (8.3) may be written umbrally as {u + w)" = 0, for n > 0, where u and 

V are independent umbrae satisfying u'^ = u„i and v'^ = Vn, and this implies that if f{x) 
is any polynomial with no constant term, then f[u + v) = 0. We shall prove by induction 
that if (8.2) holds for all n,k>0 then 

{yP - v)''v^ = (mod p") (8.4) 

for all n,k >0. 

The case n = of (8.4) is trivial. Now let AT be a positive integer and K a nonnegative 
integer, and suppose that (8.4) holds whenever n < N and also when n = N but k < K. 
Thus 

Q=[{u + vY -{u + v)f{u + v)^ 
= [{uP -u) + {yP -v)+ pR{u, v)f{u + v)^ 

for some polynomial R{u, v) with integer coefficients, 

= {yP — v)^v^ + other terms. 

Here each other term is an integer times {uP — u)"'{vP — v)'^{pR{u,v)^'^u'^v'^, where a + b + c = 
N, d + e = K , and either b<N or b = N,c = 0, and e < K. Thus by the inductive 
hypothesis and (8.2), each of the other terms is divisible by = p'^ ^ and therefore 

{yP — v)^v^ is also. □ 
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As an example, we apply Theorem 8.1 to generalized Euler numbers. Recall that the 
Euler numbers En are defined by sechz = Yl'^=Q^nx"'/n\ (so En = when n is odd). We 

define the generalized Euler numbers by 

°° 'rnn / oo mn \ ^ 

Ve(™)^^ = S"— 

(mn)\ \^(mn)\ ' 

n=0 ^ \n=0 ^ ' / 

(2) 

SO that eh. = E2n- 

Theorem 8.2. Let p he a prime and let m he a positive integer such that d = {p — l)/m 
is an integer. Then for j > n/m, 



i=0 



Proof. Let us take n„ = 1 and Vn = c^nj'ln ^ divides n, with u„ = = otherwise. 
Then the sequences {un) and {vn) satisfy (8.3), and {un) satisfy Kummer's congruence for 
p. Therefore (u„) does also. □ 

For example, if we take m = 4 and p = 5 in Theorem 8.2, then d = 1 and we have the 
congruence 



E(-ir-'( 



_ Je;j,^0 (mod 5") (8.5) 

for j > n/A. 

By the same kind of reasoning we can prove a variation of Theorem 8.1 [19]: 

Theorem 8.3. Let the sequences (un) and (vn) be related by (8.3), and suppose that for 
some integer a, 



to"-'(:) 

i=0 ^ ^ 



Uip+j = (mod 



for all nonnegative integers j and n. Then 



n , 
j=0 ^ 



Viyj^j = (mod p^). □ 



Next we prove a Kummer congruence for Bernoulli numbers. A similar, but weaker, 
congruence was proved by Carlitz [7] using a different method. 

We call a rational number 2-integral if its denominator is odd. If a and b are rational 
numbers, then by a = 5 (mod 2^) we mean that (a — h)/2^ is 2-integral. For example, 
^ = I (mod 2). We define P2(a) to be the largest integer for which a/2''2(") is 2-integral; 

so P2{\) = -1 and P2(|) = 2. 

In the proof of the next theorem we will use the fact that 2i?„ is 2-integral for all n, and 
that if n is even and positive then Bn = \ (mod 1); this follows easily by induction from 
the case A; = 2 of Theorem 7.1 (iii). 
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Theorem 8.4. For nonnegative integers n and j, 

Y^{-iy-'('^\B2i+2o = Q (mod 2--), (8.6) 
where ro,o = 0, Tj^ = ro,n = — 1 for j > and n > 0, Tj^i = 1 for j > 2, and 

Tj,n = min (^2j - 2, 2 

for n > 2 and j > 1. Moreover, the exponent in (8.6) is best possible if and only if 
j7^L(3n+l)/2j. 

Proof. For simplicity we prove only the most interesting case, in which n > 2 and j > 1- 
Let B be the Bernoulh umbra, = Bn, so the sum in (8.6) is (S^ — 1)"B^-'. 
Applying (7.7) with k = 2 and f{u) = {v? — l)"ii^-', we obtain 

(^2 - l^B"^^ = 2'^^-\AB^ - l^B'^^ + 2^'^-^B'^{B + 1)"(2S + ifK (8.7) 

The first term on the right side of (8.7) is (-l)"22i-i(S2j -4nB2i+2 H )• Since j > 0, 

this is congruent to (-l)"22i-2 (^^.^j 2^^) and thus p2(2'^'>-'^{4B^ - lYB"^^) = 2j - 2. 

Next, let g{u) = u^^iu + l)"(2tt + 1)^^. To determine p2{g{B)), we apply (7.3) in the 
form g{B) = g{B — 1) — 5''(— 1) and we find that (since n > 1) 

g{B) = S"(5 + 1)"{2D + if^ = B'^iB - 1)"(2S - ifK 

We now apply (7.9) to f{u) = u"-{u — l)"(2u — 1)^* and we obtain (since n > 1) 

g{B) = f{B) = f{2B) = rB^{2B - 1)"(4S - if^ = (-2)'^(i?, - 2ni?„+i + 2K), 

where K is 2-integral. Thus if n is even, 

g{B) = 2^Bn = 2^-1 (mod 2^), 

and if n is odd 

g{B) = 2'^+^nBn+i = 2'* (mod 2"+^). 

Thus p2{g{B)) is n — 1 if n is even and n if n is odd; so in either case we have p2{g{B)) = 
2\{n — l)/2j + 1. Thus the power of 2 dividing the second term on the right side of (8.7) is 

P2(5(^)) = (2n-l) + 2L(n-l)/2j + l = 2L(3n-l)/2j, 

and the congruence (8.6) follows. It is clear that the exponent in (8.6) is best possible if 
and only if 2j — 2 ^ 2[(3n — l)/2j and this is equivalent to the stated condition. □ 

We can use Theorem 8.4 to obtain congruences of a different kind for the Bernoulli 
numbers. As noted earlier, for sequences (c„) and (d„) we have c„ = X^"=o(~ (i')'^i 
if and only if d„ = X^"=o (i^)^j- fix j > and take dn = B2n+2j, so that c„ = 

{B^ - lYB'^K Then we have 

n 

B2n+1j = 

i=0 



3ra- 1 
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Moreover, it follows from Theorem 8.4 that if z > {2j — l)/3 and i > 2 then Cj = 
(mod 2^-'"^), so we obtain the congruence 

M 



B2n+2j = ^[ )ci (mod 22^-2), (8.^ 



i=0 

where M = max([2(j — 1)/3J,1). The cases j = 2,3,4 of (8.8), with simplifications 
obtained by reducing their coefficients, are 

12 1 

B2„+4 = -— + —n = - + 2n (mod 4) 

1 2 13 . , ^ 

^2n+6 = 42 ~ = Y + lOn (mod 16) 

5,„^3 ^ _ 1 + A, _ ^ f-U 1! + 42n + 48f (mod 64). 



30 55 5005 V2y 2 \2^ 
We note for use in the next section simpler forms of the first two of these congruences: 

Lemma 8.5. Let n be an even integer. 

(i) Ifn>A then Bn = \ + n (mod 4). 

(ii) Ifn>6 then B„ = i + 5n (mod 16). □ 

Of course, more direct proofs of this lemma are possible. Similar congruences for 
Bernoulli numbers to other moduli have been given by Frame [16]. Many congruences 
for generalized Euler numbers, obtained in this way from Kummer congruences, can be 
found in [19]. 

9. Median Genocchi numbers and Kummer congruences for Euler numbers 
It follows from Theorem 8.3 that the Euler numbers En satisfy the congruence 



X:(^)^2.+,=0 (mod 2") 



However, Probenius [17] (see also Carlitz [7]) proved a much stronger congruence: the 
power of 2 dividing 



(for j even) is the same as the power of 2 dividing 2"n!. Using the same approach as 
Probenius and Carlitz, we prove in this section a Kummer congruence for the numbers = 
nEn-i in which the modulus is 2^" or 2^"~^, and derive from it Probenius's congruence. 

A special case of our result gives a divisibility property for the median Genocchi numbers 
(also called Genocchi numbers of the second kind) . These numbers may be defined by 



H2n+l — XI ( I. ) 5'n+fc+l, 
fe=0 ^ ^ 
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where the gi are the Genocchi numbers, defined by J2i^o9i^^/^^- ~ 2a;/(e^ + 1). (In com- 
binatorial investigations, the notation -ff2n+i is usually used for what in our notation is 
|-ff2n+i| = i~^)"'H2n+i-) The Connection between median Genocchi numbers and the num- 
bers Fn = nEn-1 is given by the following result, due to Dumont and Zeng [12]. 



Lemma 9.1. 



Proof. Let us define the umbrae g and F hy g"' = gn and F" = F^ = nEn-i, so that 
gFs _ ^ gggj^ ^ Then 

^ ^ = 2e-^/2 . - sech - = 2e-^/2gf . ^ 2e5(^-i)-. 

e^ + l e=^/2 -I- 2 2 

Thus 5" = 2 ('^^)", and it follows by linearity that 

/ F -i- / \ "-^^ 

22'^i^2n+i = 22"(5 + l)V+^ = 2^'*-2 



2 

= {F- 1){F^ - If = F{F^ - 1)", 
since = for m even. □ 

Barsky [4] proved a conjecture of Dumont that H2n+\ is divisible by 2"~^. More precisely, 
Barsky proved that for n > 3, i^2n+i/2"'^^ is congruent to 2 modulo 4 if n is odd and is 
congruent to 3 modulo 4 if ra is even. Kreweras [30] gave a combinatorial proof of Dumont's 
conjecture, using a combinatorial interpretation of -ff2n+i due to Dumont [11, Corollaire 
2.4]. A g-analogue of Barsky 's result was given by Han and Zeng [24]. 

It is interesting to note that (as pointed out in [38]), a combinatorial interpretation of 
the numbers i?2n+i/2"'~^ was given in 1900 by H. Dellac [9]. Dellac's interpretation may 
be described as follows: We start with a 2n by n array of cells and consider the set D of 
cells in rows i through i + n of column i, for i from 1 to n. Then if2n+3/2"' is the number of 
subsets of D containing two cells in each column and one cell in each row. Dellac did not 
give any formula for these numbers, but he did compute them for n from 1 to 8. Dellac's 
interpretation can be derived without too much difficulty from Dumont's combinatorial 
interpretation, but it is not at all clear how Dellac computed these numbers. 

Theorem 9.2. Let F„ = nEn^i, so that Yl'^=o Fnx'^/n\ = xsechx. Let 2^." he the highest 
power of 2 dividing 



F, 



1=0 

inhere j is odd. Then /ij = 0, fij^i = 2, and for n > 1, pLj^n = Sn if n is odd and 

fJ-j.n = 3n — 1 if n is even. 

Proof. We have 

„ _^ 2 2 + 2e-2^ 2e-2^ 2e-^ 
sech x — 2e ^ = = = — ^ = -2- 



+ e~^ e^ + e~^ + e~^ e^^ -|- 1 e^^ — 1 
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Therefore 

xsedix = 2xe'^ — sinhx ■ B{Ax) (9.1) 

where B{x) = xjie^ — 1) is the Bernoulli number generating function. 
Now let us define the umbrae F, A, B, and C by 

e^^ = X sech x 



n! 

n=l 



e^^ = B{x) = 



- 1 

e*^^ = sinhx = ^ 

n odd 

Then from (9.1) we have 

pn ^ 2^1" - {4B + C)". (9.2) 
We want to find the power of 2 dividing F^{F^ — 1)". It follows from (9.2) that 

i?i(i?2 _ ;L)n ^ 2A^{A^ - 1)" - (45 + Cy{{AB + C)^ - l)". (9.3) 
First note that for any polynomial p, p{A) = p'{—l). Therefore, 

'2(-l)J-ij ifn = 

4(-l)^-i if n = 1 (9.4) 
if n > 2 

We note also that C-^^C^ - 1)" = for all integers j > and n > 1. Then 

{4B + Cy{{AB + (7)2 - 1)" = (4B + Cy{16B^ + SBC + - 1)" 

= 2^" (45 + C)^' (2^2 + 5(7)". (9.5) 

We now need to determine the power of 2 dividing {4B + Cy {23"^ + BC)"^. Since 
F^{F'^ — 1)" = if j is even, we may assume that j is odd. Since 2Bi is 2-integral, we have 

(45 + Cy{2B^ + BCY = &{B'^C'' + 2715^^+^(7"-^) (mod 2). 

Using the facts that 5j = when i is odd and greater than 1, and that 

1 if z is odd 



2A^{A^ -!)" = < 



C 

we find that if n = then 



if z is even. 



(45 + C)^ (252 + 5(7)" = 5o = l (mod 2), 
if n is even and positive then 

{AB + Cy{2B'^ + BCY = Bn = \ (modi). 
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and if n is odd then 

{4B + Cy{2B^ + BC)" = 2nBn+i = 1 (mod 2). 
The theorem then follows from these congruences, together with (9.3), (9.4), and (9.5). □ 

By taking more terms in the expansion of [AB + Cy {2B'^ + BC)'^ , we can get congruences 
modulo higher powers of 2. For example, if n is even then 



{iB + Cy{2B' + BCr = Bn + 4[^] Bn+2 (mod 8) . 



Applying Lemma 8.5 (ii), we find that if ra > 6 then 



Bn + 



4 B„+2 = i + (n + 3) (mod 8) . 



Since n even implies = 2n (mod 8) and 4n = (mod 8), this simplifies to | — 2n 
(mod 8). 

Similarly, if n is odd then 

{4B + Cy{2B^ + BCT = (2n + 4j)Bn+i + "^(fj (mod 8). 

If n > 3 then Lemma 8.5 (i) gives (2n + 4j)S„+i = (n + 2_7)(3 + 2n) (mod 8), and we may 
easily verify that for n odd, 4(3) = 2n — 2 (mod 8). Using the fact that n odd implies 
= 1 (mod 8) and 4ra = 4 (mod 8), we obtain 

(2n + Aj)Bn+i + 4 Q = 4 + 2i + n (mod 8) . 

Therefore we may conclude that (for j odd) if n is even and n > 6 then 

16), 



2-(3n-i) ^(_i)"-* M F2i+j = 4n - 1 (mod 



and if n is odd and n > 3 then 



g(.„.-.(.;) 



^(_i)n-^ ( ^ ) F2i+j ^ 4 - 2i - n (mod 8). 

i=0 

In particular, we get a refinement of Barsky's theorem: If n is even and n > 6 then 
iJ2n+i/2"~^ = 4n - 1 (mod 16), and if n is odd and n > 3 then iJ2n+i/2" = 2 - n 
(mod 8). It is clear that by the same method we could extend these congruences to any 
power of 2. 

Next we derive Probenius's congruence from Theorem 9.2. (This derivation is similar to 

part of Frobenius's original proof.) Define the umbra E by e^^ = scchx, so F"' = nE^~^. 
First note that if j is even then Ej = Fjj^i/[j + 1) is odd, so Frobenius's congruence 
holds for n = 0. From = nE'^~^, it follows that for any polynomial p, we have 
p{F) = p'{E). Let us take p{u) = u-^+^(n^ — 1)", where j is even and n > 1. Then 
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p'{u) = {j + l)u^{u'^ - 1)" + 2nu^+'^{u'^ - 1)"-^ By Theorem 9.2, we have p{F) = 
(mod 23"-^), so 

Ej{E'' - 1)" = --^E^+^{E^ - (mod 2=^"-^). 

By induction on n, the power of 2 dividing E^~^'^{E'^ — 1)"'"^ is equal to the power of 2 
dividing 2'^~^(n — 1)!, and Frobenius's result follows. 

In view of Theorem 9.2, it is natural to ask whether there are analogous congruences for 
generalized Euler numbers. There seem to be many possibilities, but the most attractive 
is given by the following conjecture: Define numbers by 

°° ^(2n+l)m / °° ^2nm 



V f^"')— = _ / V 



i{2n + l)m)\ m\ ^(2nm)!' 

n=0 ' / ' n=0 ^ ' 

(Thus /^') is F2n+i as defined above.) Let 2'*j>'* be the highest power of 2 dividing 



E(-ir-(:)/S- 



i=0 

Then for t > 1, we have /ij,o,t = 0, = 4, and for n > 1, /ij,n,t = \Jy\ " 1) except when 
t = 1, n = 2 (mod 4), and n > 6. 

In the exceptional case, ^i.j;n,i = ^ + 2 + /92(j + "^n)-, where dn is some integer or 2-adic 
integer. The first few values of t?„ (or reasonably good 2-adic approximations to them) are 
i?6 = 118, t?io = 7, i?i4 = 2, t?i8 = 13, t?22 = 32, and i?26 = 27. 

By way of illustration, /xo,i,t = 4 for t > 1 is equivalent to the (easily proved) assertion 

that 1 + (^2?*) is divisible by 16 but not by 32. 

10. Bell numbers 
The Bell numbers -B„ are defined by the exponential generating function 

°° n 

S(a;) = Vi?n^ = e^^-^ (10.1) 

Although we are using the same notation for the Bell numbers that we used for Bernoulli 
numbers, there should be no confusion. Rota [34] proved several interesting properties 
of the Bell numbers using umbral calculus in his fundamental paper. Here we prove a 
well-known congruence of Touchard for Bell numbers and a generalization due to Carlitz. 
Differentiating (10.1) gives B'{x) = e^B{x), so 



k=0 ^ ^ 



With the Bell umbra B, given by = this may be written 5"+^ = (B ^ 1)". Then 
by linearity, for any polynomial j(x) we have 

S/(5) = /(S + l) (10.2) 
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A consequence of (10.2), easily proved by induction, is that for any polynomial f{x) and 
any nonnegative integer n, 

B{B-l)---{B-n + l)f{B) = f{B + n). (10.3) 

(A g-analogue of (10.3) has been given by Zeng [46, Lemma 8].) If we take f(x) = 1 in 
(10.3), we obtain (since Bq = 1) 

B{B -1){B -2)---{B -n + 1) = 1. (10.4) 

We note that Rota took (10.4) as the definition of the Bell umbra and derived (10.2) and 
(10.1) from it. 

As an application of these formulas, we shall prove Touchard's congruence for the Bell 
numbers [40, 41]. 

If f{x) and g{x) are two polynomials in Z[x], then by f{x) = g{x) (mod p). we mean that 
f{x)—g{x) G pZ[x]. We first recall two elementary facts about congruences for polynomials 
modulo a prime p. First wc have Lagrange's congruence, x{x — \) ■ ■ ■ {x — p + 1) = x^ — x 
(mod p). Second, if g{x) € Z[x] then g{x + p) — g{x) = (mod p). 

Theorem 10.1. For any prime p and any nonnegative integer n, 

Bn+p - Bn+1 - Bn = (mod p). 

Proof. By Lagrange's congruence, 

{BP-B-\)B'^ = {B{B - 1) ■ ■ ■ {B - p + 1) - 1)B'' (modp). 

By (10.3), 

{B{B - 1) • • • (B - p + 1) - 1)5" = (S +p)" - 5". 
Since (a; + p)^ — x"" = (mod p), p divides {B + p)^ — B^. □ 

Next we prove a generalization of Touchard's congruence analogous to a Kummer con- 
gruence, due to Carlitz [6] . 

Theorem 10.2. For any prime p and any nonnegative integers n and k, 

{BP-B- 1)^=5" = (mod ). 

Proof. Let L{x) be the polynomial x{x—l) ■ ■ ■ {x—p+l) — l. First we show that it suffices to 
prove that for any polynomial f{x) G Z[x], L{B)^ f {B) = (mod p^^/'^'^). To see this, note 
that we may write L{x) = — .x — l — pR{x), where R{x) G Z[x]. Then [B^ — B — 1)^B^ = 
{L{B) + pR{B)fB'^ = Y!1=q {^i)p'L{Bf-'R{ByB'^, and our hypothesis will show that 
p^LiBf-^RiByB"" is divisible by p to the power i+\{k- i)/2\ = \{k + i)/2\ > \k/2]. 
We now prove by induction on k that for any polynomial f{x) G Z[a;], 

L{Bff{B) = (modprfc/21). 

The assertion is trivially true for k = 0. For the induction step, note that we may write 
L{x + p) = L{x) +pJ{x), where J{x) G Z[x], and recall that by (10.3), L{B)g{B) = 
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g{B +p) — g{B) for any polynomial g. Then for any f{x) G 7,[x] we have for A; > 

L{Bff{B) = L{B) . L{Bf-^f{B) = L{B + pf-^ f {B + p) - L{Bf-^f{B) 
= {L{B) +pJ{B)f-^f{B+p) - L{Bf-^f{B) 

= L{Bf-^ {f{B +p)- m) +J2p'(^~^) L{B)'-'-'J{Byf{B + p). 

We show that each term of the last expression is divisible by p^^/"^^ . Since f{x + p) — f{x) = 
ph{x) for some h{x) G Z[x], we have L{B)'^^^[f{B+p)-f{B)) = pL{BY^^h{B), which by 
induction is divisible by p to the power l+\{k — l)/2] > \k/2\ . By induction also, the zth 
term in the sum is divisible by p to the power z + [(A; — 1 — i)/2] = — 1 + z)/2] > [fe/2]. 
This completes the proof. □ 

Theorem 10.2 can be extended in several ways (in particular, the modulus can be im- 
proved); see Lunnon, Pleasants, and Stephens [31] and Gessel [20], which both use umbral 
methods (though the latter is primarily combinatorial). Another congruence for Bell num- 
bers, also proved umbrally, was given by Gertsch and Robert [18]. 
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